首页> 外文OA文献 >Robust Probabilistic Predictive Syntactic Processing
【2h】

Robust Probabilistic Predictive Syntactic Processing

机译:鲁棒概率预测句法处理

摘要

This thesis presents a broad-coverage probabilistic top-down parser, and itsapplication to the problem of language modeling for speech recognition. Theparser builds fully connected derivations incrementally, in a single pass fromleft-to-right across the string. We argue that the parsing approach that wehave adopted is well-motivated from a psycholinguistic perspective, as a modelthat captures probabilistic dependencies between lexical items, as part of theprocess of building connected syntactic structures. The basic parser andconditional probability models are presented, and empirical results areprovided for its parsing accuracy on both newspaper text and spontaneoustelephone conversations. Modifications to the probability model are presentedthat lead to improved performance. A new language model which uses the outputof the parser is then defined. Perplexity and word error rate reduction aredemonstrated over trigram models, even when the trigram is trained onsignificantly more data. Interpolation on a word-by-word basis with a trigrammodel yields additional improvements.
机译:本文提出了一种涵盖面广的概率自上而下的解析器,并将其应用于语音识别的语言建模问题。分析器通过从左到右遍历整个字符串逐步递增地建立完全连接的派生。我们认为,从心理语言的角度出发,我们采用的解析方法是一种动机,它是一种捕获词汇项之间的概率依存关系的模型,是建立连接的句法结构的过程的一部分。提出了基本的解析器和条件概率模型,并针对报纸文本和自发电话对话的解析准确性提供了实验结果。提出了对概率模型的修改,从而改进了性能。然后定义一个使用解析器输出的新语言模型。在三字组合模型上也证明了困惑性和单词错误率的降低,即使在对三字组合进行大量数据训练时也是如此。使用trigrammodel在逐个单词的基础上进行插值会产生其他改进。

著录项

  • 作者

    Roark, Brian;

  • 作者单位
  • 年度 2001
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号